Incremental CFG Parsing with Statistical Lexical Dependencies
نویسندگان
چکیده
Incremental parsing with a context free grammar produces partial syntactic structures for an initial fragment on the word-by-word basis. Owing to the syntactic ambiguity, however, too many structures are produced, and therefore its parsing speed becomes very slow. This paper describes a technique for efficient incremental parsing using lexical information. The probability concerning dependencies between words, as the lexical information, is automatically acquired from a large-scale corpus with syntactic structures. A process for discarding syntactic structures which will not be likely has been integrated into the incremental chart parsing. That is, partial syntactic structures whose dependency probabilities are not high will be removed from the chart. Our technique proposed in this paper can also be considered as a kind of practical methods of incremental disambiguation. An experiment using Penn Treebank has shown our technique to be feasible and efficient.
منابع مشابه
Dependency Parsing Resources for French: Converting Acquired Lexical Functional Grammar F-Structure Annotations and Parsing F-Structures Directly
Recent years have seen considerable success in the generation of automatically obtained wide-coverage deep grammars for natural language processing, given reliable and large CFG-like treebanks. For research within Lexical Functional Grammar framework, these deep grammars are typically based on an extended PCFG parsing scheme from which dependencies are extracted. However, increasing success in ...
متن کاملIncremental LTAG Parsing
We present a very efficient statistical incremental parser for LTAG-spinal, a variant of LTAG. The parser supports the full adjoining operation, dynamic predicate coordination, and non-projective dependencies, with a formalism of provably stronger generative capacity as compared to CFG. Using gold standard POS tags as input, on section 23 of the PTB, the parser achieves an f-score of 89.3% for ...
متن کاملIncremental Parsing with Monotonic Adjoining Operation
This paper describes an incremental parser based on an adjoining operation. By using the operation, we can avoid the problem of infinite local ambiguity in incremental parsing. This paper further proposes a restricted version of the adjoining operation, which preserves lexical dependencies of partial parse trees. Our experimental results showed that the restriction enhances the accuracy of the ...
متن کاملIncremental Parsing with Adjoining Operation
This paper describes an incremental parser based on an adjoining operation. By using the operation, we can avoid the problem of infinite local ambiguity. This paper further proposes a restricted version of the adjoining operation, which preserves lexical dependencies of partial parse trees. Our experimental results showed that the restriction enhances the accuracy of the incremental parsing. ke...
متن کاملStatistical Parsing with an Automatically-Extracted Tree Adjoining Grammar
Why use tree adjoining grammars (TAG) for statistical parsing? It might be thought that its added formal power makes parameter estimation unnecessarily difficult; or that whatever benefits it provides—the ability to model unbounded cross-serial dependencies, for example— are inconsequential for statistical parsing, which is concerned with the probable rather than the possible. But just as TAG i...
متن کامل